CDS

Accession Number TCMCG078C06519
gbkey CDS
Protein Id KAG0457993.1
Location complement(join(32076624..32076986,32077349..32077467,32077567..32077642,32077743..32077907,32082879..32082959,32083047..32083127,32083264..32083329,32083438..32083503,32088299..32088349,32088455..32088547,32088645..32088772,32088880..32089018,32099282..32099379,32099491..32099560,32099683..32099772))
Organism Vanilla planifolia
locus_tag HPP92_023150

Protein

Length 561aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA633886, BioSample:SAMN14973820
db_source JADCNL010000012.1
Definition hypothetical protein HPP92_023150 [Vanilla planifolia]
Locus_tag HPP92_023150

EGGNOG-MAPPER Annotation

COG_category TU
Description Clathrin assembly protein
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko04131        [VIEW IN KEGG]
KEGG_ko ko:K20043        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs GO:0005575        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005886        [VIEW IN EMBL-EBI]
GO:0016020        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0071944        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGCGACGATGCAGAGTTGGCGGAAGGCCTACGGTGCTCTCAAGGATTCGACGACCGTTGGTCTAGCTAATCTCAATAGCGATTTCAAGGAGCTGGACGTTGCCATCGTCAAGGCAACGAATCACGTTGAGTGCCCGCCAAAGGAGAGGCATATCAGAAAGATCTTGGCCGCTACTCTCATATCGCGGCCGCGAGCTGATGTGGCCTATTGCATTCGTGCGCTAGCGAGACGGCTGTCAAAAACGCACAATTGGACGGTAGCTTTGAAGACACTAATAGTTATACATAGGTCTTTAAGGGAGGGTGATCCAACTTTCCGTGAAGAGTTTCTCAATTTTTCTCAAAGGGCAGGCATTCTTCAATTATCAAACTTCAAGGATGATTCAAGTCCTATAGCTTGGGATTGCTCTGCATGGGTTAGAACTTATTCTTTATTTCTGGAGGAAAGATTAGAGTGCTTCCGGATATTAAAGTATGATGTCGAAGCTGAACGCTTAGTCAAACCTGCTGATGGTTCGGAAAAGGGCCACAGTAGAACAAGGGATTTAAATTCGGAGGAGTTGTTACTGCAGTTACCTGCACTTCAACAATTGCTTTACCGGCTTATAGGATGTTGTCCCGAAGGAGCTGCGATCAATAATTATGTAATTCAGTATGCCTTAGCTTTGGTTTTAAAAGAAAGTTTCAAGATATATTGTGCTATTAATGATGGCATCATCAATCTTGTTGATAAGTTCTTTGAGATGCCAAGACATGAAGCAGTCAAAGCCCTTGATATCTACAGAAAAGCTGGTCAACAGGCTATTAATCTCTCTGAATTTTATGAGGTATGTAGAGGATTAGAGCTTGCTAGGAATTTCCAGTTTCCAAATTTGAGAGAGCCCCCACAATCATTTCTTGCAACCATGGAAGAATATATAAGAGAGGCTCCACGAGTAGTTTCTGTTCCCAGTGAGCCTCTGGAATTTCCTGAGAGACTTCTTTTGACATACAAACAACCAGAGGATGCTCCTACTGTTGTTGAGGATGAAAAACCATTTGATGAAGGCACAAAGCAGGAACCTTCTCATGTAGAGTTTGAAGCTGCACCTGGCCCACACCAGCAGGAAGATACTGGAGATTTGTTGGGATTGAATGACTTTAATCCTGGTGCATCTGCAATAGAAGAAAGCAATGCATTGGCTCTAGCAATTGTTCCATCCGACATTCCCTCCAGTAATTCTGAGGCAGTTCATGAAAAATCATTTGATCCATCTGGATGGGAGCTGGCCCTCGTTTCTACAACTAGTACTATTAACTCATCTGCAGTTGAAAGCCAGCTGGGTGGTGGCTTTGATAAGCTTACATTGGATAGCTTGTATGATGATGGCGCATATAGACAGCAGCAACAACAACATTTCTATGGTCCTCCTGCGCCAAATCCCTTCCTGACTGATCCATTTGCTGTATCAAACCAAGTAGCTGCTCCTCCAGCCGTGCAGATGGCAGCTTTGTCTCAGCATCAGTCCTTCGTGATCCAGTCCAACCCTTTCATGCAGCCATTGCCCGCAGGGCATCAGCAACCCCTGGTGGCTGGGATTCCCGCCGCGAACCCCTTTGCAGACACAACTGGTTTTGGGACTTTTACGGCGGCAAACACGACCCACTACCAAAGCAATCCTTTTGGAAGCACACAGCTGCTTTAG
Protein:  
MATMQSWRKAYGALKDSTTVGLANLNSDFKELDVAIVKATNHVECPPKERHIRKILAATLISRPRADVAYCIRALARRLSKTHNWTVALKTLIVIHRSLREGDPTFREEFLNFSQRAGILQLSNFKDDSSPIAWDCSAWVRTYSLFLEERLECFRILKYDVEAERLVKPADGSEKGHSRTRDLNSEELLLQLPALQQLLYRLIGCCPEGAAINNYVIQYALALVLKESFKIYCAINDGIINLVDKFFEMPRHEAVKALDIYRKAGQQAINLSEFYEVCRGLELARNFQFPNLREPPQSFLATMEEYIREAPRVVSVPSEPLEFPERLLLTYKQPEDAPTVVEDEKPFDEGTKQEPSHVEFEAAPGPHQQEDTGDLLGLNDFNPGASAIEESNALALAIVPSDIPSSNSEAVHEKSFDPSGWELALVSTTSTINSSAVESQLGGGFDKLTLDSLYDDGAYRQQQQQHFYGPPAPNPFLTDPFAVSNQVAAPPAVQMAALSQHQSFVIQSNPFMQPLPAGHQQPLVAGIPAANPFADTTGFGTFTAANTTHYQSNPFGSTQLL